MIT  LIBRARIES 


3  9080  02618  0908 


.M415 


Digitized  by  the  Internet  Archive 

in  2011  with  funding  from 

Boston  Library  Consortium  Member  Libraries 


http://www.archive.org/details/admissibleinvariOOcher 


HB31 
.M415 


DEWEV 


Massachusetts  Institute  of  Technology 

Department  of  Economics 

Working  Paper  Series 


ADMISSIBLE  INVARIANT  SIMILAR  TESTS  FOR 
INSTRUMENTAL  VARIABLES  REGRESSION 

Victor  Chemozhukov 
Christian  Hansen 
Michael  Jansson 


Working  Paper  07-09 
March  12,  2007 


Room  E52-251 

50  Memorial  Drive 

Cambridge,  MA  02142 


This  paper  can  be  downloaded  without  charge  from  the 

Social  Science  Research  Network  Paper  Collection  at 

http://ssrn.com/abstract=97061 6 


Admissible  Invariant  Similar  Tests  for  Instrumental  Variables  Regression* 

Victor  Chernozhukov  Christian  Hansen 

Department  of  Economics         Graduate  School  of  Business 

M.I.T.  University  of  Chicago 

Michael  Jansson 

Department  of  Economics 

UC  Berkeley 

March  12,  2007 


Abstract.  This  paper  studies  a  model  widely  used  in  the  weak  instru- 
ments literature  and  establishes  admissibility  of  the  weighted  average  power 
likelihood  ratio  tests  recently  derived  by  Andrews,  Moreira,  and  Stock  (2004). 
The  class  of  tests  covered  by  this  admissibility  result  contains  the  Anderson 
and  Rubin  (1949)  test.  Thus,  there  is  no  conventional  statistical  sense  in  which 
the  Anderson  and  Rubin  (1949)  test  "wastes  degrees  of  freedom".  In  addition, 
it  is  shown  that  the  test  proposed  by  Moreira  (2003)  belongs  to  the  closure  of 
(i.e.,  can  be  interpreted  as  a  limiting  case  of)  the  class  of  tests  covered  by  our 
admissibility  result. 

1.  Introduction 
Conducting  valid  (and  preferably  "optimal")  inference  on  structural  coefficients  in 
instrumental  variables  (IVs)  regression  models  is  known  to  be  nontrivial  when  the 
IVs  are  weak.^  Influential  papers  on  this  subject  include  Dufour  (1997)  and  Staiger 
and  Stock  (1997),  both  of  which  highhght  the  inadequacy  of  conventional  asymptotic 
approximations  to  the  behavior  of  two-stage  least  squares  and  point  out  that  valid  in- 
ference can  be  based  on  the  Anderson  and  Rubin  (1949,  henceforth  AR)  test.  Several 
methods  intended  to  enjoy  improved  power  properties  relative  to  the  AR  test  have 
been  proposed,  prominent  examples  being  the  conditional  likelihood  ratio  (CLR)  and 
Lagrange  multiplier  (LM)  tests  of  Moreira  (2003)  and  Kleibergen  (2002),  respectively. 
All  of  the  abovementioned  methods  and  results  have  been  or  can  be  deduced  within 
an  IV  regression  model  with  a  single  endogenous  regressor,  fixed  (i.e.,  non-stochastic) 
IVs,  and  i.i.d.  homoskedastic  Gaussian  errors.  Studying  that  model,  Andrews,  Mor- 
eira, and  Stock  (2004,  henceforth  AMS)  obtain  a  family  of  tests,  the  so-called  weighted 

*The  authors  thank  Jim  Powell,  Paul  Ruud,  a  co-editor,  and  (in  particular)  two  referees  for  very 
valuable  comments. 

^Recent  reviews  include  Stock,  Wright,  and  Yogo  (2002),  Dufour  (2003),  Hahn  and  Hausman 
(2003),  and  Andrews  and  Stock  (2006). 
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average  power  likelihood  ratio  (WAP-LR)  tests,  each  member  of  which  enjoys  demon- 
strable optimality  properties  within  the  class  of  tests  satisfying  a  certain  invariance 
restriction.  The  invariance  restriction  in  question,  namely  that  inference  is  invariant 
to  transformations  of  the  minimal  sufficient  statistic  iS,T)  (defined  in  (3)  below) 
corresponding  to  a  rotation  of  the  instruments,  is  satisfied  by  the  AR,  CLR,  and 
LM  tests.  Furthermore,  testing  problems  involving  the  structural  coefficient  are  ro- 
tation invariant  under  the  distributional  assumptions  employed  by  AMS.  For  these 
reasons  (and  others),  the  rotation  invariance  restriction  seems  "natural",  in  which 
case  AMS's  numerical  finding  that  the  CLR  test  is  "nearly"  efficient  relative  to  the 
class  of  rotation  invariant  tests  provides  strong  evidence  in  favor  of  the  CLR  test. 

Nevertheless,  because  best  invariant  procedures  can  fail  to  be  admissible  even  if 
they  exist,  it  is  not  entirely  obvious  whether  it  is  "natural"  to  confine  attention  to 
rotation  invariant  tests  when  developing  optimality  theory  for  hypothesis  tests  in  the 
model  of  AMS.  In  particular,  it  would  appear  to  be  an  open  question  whether  the 
members  of  the  WAP-LR  family,  upon  which  the  construction  of  the  two-sided  power 
envelope  of  AMS  is  based,  are  even  admissible  (in  the  Gaussian  model  with  fixed  IVs). 
We  show  that  all  members  of  the  WAP-LR  family  are  indeed  admissible,  essentially 
because  the  defining  optimality  property  of  these  tests  can  be  reformulated  in  such  a 
way  that  rotation  invariance  becomes  a  conclusion  rather  than  an  assumption.  The 
AR  test  belongs  to  the  WAP-LR  family  and  is  therefore  admissible.  In  contrast, 
the  CLR  and  LM  tests  do  not  seem  to  admit  WAP-LR  representations,  though  we 
demonstrate  here  that  these  test  statistics  "nearly"  admit  WAP-LR  interpretations 
in  the  sense  that  they  belong  to  the  closure  (appropriately  defined)  of  the  class  of 
WAP-LR  tests. 

Section  2  introduces  the  model  and  defines  some  terminology  needed  for  the  de- 
velopment of  the  formal  results  of  the  paper,  all  of  which  are  stated  in  Section  3  and 
proven  in  Section  4.  -  ■ 


Consider  the  model 


Preliminaries 


y2    =    Ztt  +  V2,  (1) 


where  yi,  y2  S  IR"  and  Z  G  M"''''  are  observed  variables  (for  some  A;  >  2);  ^  G  K  and 
TT  G  M''  are  unknown  parameters;  and  u,  V2  G  M"  are  unobserved  errors.^ 

^It  is  straightforward  to  accommodate  exogenous  regrcssors  in  (1) .  We  tacitly  assume  that  any 
such  regressors  have  been  "partialed  out".  This  assumption  is  made  for  simplicity  and  entails  no 
loss  of  generality  (for  details,  see  Section  2  of  AMS). 
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Suppose  P  is  the  parameter  of  the  interest.  Specifically,  suppose  we  are  interested 
in  a  testing  problem  of  the  form 


//o  :  /3  =  /?o         vs.         H,:p^  /?„. 
Writing  the  model  in  reduced  form,  we  have: 


Hi     =     ZttP  +  Vi, 

y2    =    Z7r  +  V2,  '•  .         (2) 


where  vi  =  u  +  V20-  Following  AMS  (and  many  others),  we  treat  Z  as  a  fixed  n  x  k 
matrix  with  full  column  rank  and  we  assume  that  {v[,V2)'  ~  A^(0,  fi  ®  /„) ,  where  Q 
is  a  known,  positive  definite  2x2  matrix.  Without  loss  generality  we  normalize  a 
variety  of  unimportant  constants  by  assuming  that  /?q  =  0,  Z'Z  =  Ii^,  and  that  fl  is 
of  the  form 

{6    1  + 

where  6  G  M  is  known. '^ 

Under  the  stated  assumptions,  the  model  is  fully  parametric,  the  (multivariate 
normal)  distribution  of  (yi,y2)  being  completely  specified  up  to  the  parameters  /3 
and  n.  A  minimal  sufficient  statistic  for  (3  and  tt  is  given  by 

-,  •    (r)  =  (z'fe'\,,)-^("'>«--^-)-  ,    ») 

where 

Because  (5,  T)  is  sufficient,  the  totality  of  attainable  power  functions  is  spanned 
•'These  assumptions  correspond  to  the  model  in  which  (Z,  j/i,j/2)  has  been  replaced  by 

where  uj,j  is  the  {i,j)  element  of  fi  and  0^22.1  =  "^22  —  '^r/'^i2-  ^^^  '■^e  chosen  parameterization, 
the  parameter  S  is  related  to  the  correlation  coefficient  p  computed  from  Cl  through  the  formula 
6  =  p/,/l-p'-. 
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by  the  set  of  power  functions  associated  with  (possibly  randomized)  tests  based  on 
{S,T) .  Any  such  test  can  be  represented  by  means  of  a  [0,  l]-valued  function  </>(■) 
such  that  Ho  is  rejected  with  probabiHty  4>  {s,  t)  if  {S,  T)  =  (s,  t) .  The  power  function 
of  this  test  is  the  function  (with  arguments  P  and  vr)  Ep^^^cf)  (5,  T) ,  where  the  subscript 
on  E  indicates  the  distribution  with  respect  to  which  the  expectation  is  taken. 
For  any  a  €  (0, 1) ,  a  test  with  test  function  0  is  of  level  a  if 

sup^Eo,^0(5,T)  <a.  (4) 

A  level  a  test  with  test  function  0  is  said  to  be  a-admissible  if 

E0^^<l>{S,T)<Ep,,^{S,T)         V(/3,7r)  (5) 

implies 

Ef,,^<P{S,T)  =  Ep,^ip{S,T)         V(/3,7r)  (6) 

whenever  the  test  associated  with  (p  is  of  level  a^ 

The  main  purpose  of  the  present  paper  is  to  investigate  the  a-admissibihty  prop- 
erties of  certain  recently  developed  rotation  invariant,  a-similar  tests.  By  definition, 
a  rotation  invariant  test  is  one  whose  test  function  4>  satisfies  </>  [OS,  OT)  =  (p  {S,  T) 
for  every  orthogonal  k  x  k  matrix  O  and  an  Of-similar  test  is  one  for  which 

.     -    E^,,(i^{S,T)  =  a        Vtt.        .         ::   ■.:..       :•,  (7) 

The  rotation  invariant,  a-similar  tests  under  consideration  here  are  the  WAP-LR 
tests  of  AMS.  By  construction,  a  size  a  WAP-LR  test  maximizes  a  WAP  criterion  of 
the  form  . 

/      £;^X5,r)dH/(/3,7r)  (8), 

among  rotation  invariant,  a-similar  tests,  where  the  weight  function  W  is  some  cu- 
mulative distribution  function  (cdf)  on  M!'^^  . 

In  spite  of  the  fact  that  the  defining  property  of  a  WAP-LR  test  is  an  optimality 
property,  it  is  not  obvious  whether  a  WAP-LR  is  a-admissible,  the  reason  being  that 

^In  the  model  under  study  here,  the  present  notion  of  Q-admissibihty  agrees  with  that  of  Lchmann 
and  Romano  (2005,  Section  6.7)  (in  which  (5)  and  (6)  are  required  to  hold  for  all  /?  7^  0  and  all  tt) 
because  (i)  all  power  functions  arc  continuous  and  (ii)  the  set  {/3  :  /3  7^  0}  is  a  dense  subset  of  R. 
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admissibility  of  optimum  invariant  tests  cannot  be  taken  for  granted  [e.g.,  Lehmann 
and  Romano  (2005,  Section  6.7)].  We  show  in  Section  3  that  any  WAP-LR  test  max- 
imizes a  WAP  criterion  among  the  class  of  a-similar  tests.  In  the  present  context, 
that  optimahty  property  is  sufficiently  strong  to  imply  that  any  WAP-LR  test  is  a- 
admissible. 

Remark.  The  fact  that  the  alternative  hypothesis  is  dense  in  the  maintained  hypoth- 
esis implies  that  any  a-admissible  test  is  d-admissible  in  the  sense  of  Lehmann  and 
Romano  (2005).  The  (rotation  invariant)  posterior  odds  ratio  tests  of  Chamberlain 
(2006)  are  d-admissible  almost  by  construction  [e.g.,  Lehmann  and  Romano  (2005, 
Theorem  6.7.2  (i))],  but  because  these  tests  are  not  necessarily  similar  it  is  unclear 
whether  these  tests  are  also  a-admissible  (for  some  a). 

3.     Results  ■ 

3.1.  Admissible  a-similar  tests.  Two  basic  facts  about  exponential  families 
greatly  simplify  the  construction  of  o-admissible,  a-similar  tests.  First,  because  the 
power  function  of  the  test  with  test  function  (p  can  be  represented  as 

E0,^cP{S,T)=  [     [   (j){s,t)fs{s\P,n)fT{t\P,n)dsdt,  (9) 

where  fs  {-{P,  tt)  and  fr  (-1/3,  vr)  denote  the  densities  (indexed  by  /?  and  n)  of  S  and 
T,  it  follows  from  Lehmann  and  Romano  (2005,  Theorem  2.7.1)  that  Ep^T,(j){S,T) 
is  a  continuous  function  of  (/3,  tt)  .  Therefore,  an  o-similar  similar  test  cannot  be 
dominated  by  a  level  a  test  which  is  not  a-similar.  By  implication,  an  a-similar  test 
with  test  function  (f)  is  a-admissible  if  (5)  implies  (6)  whenever  the  test  associated 
with  if  is  a-similar. 

Second,  because  tt  is  unrestricted,  T  is  a  complete,  sufficient  statistic  for  it  under 
Hq  [e.g.,  Moreira  (2001)].  As  a  consequence,  a  test  with  test  function  (f)  is  a-similar 
if  and  only  if  it  is  conditionally  a-similar  in  the  sense  that,  almost  surely, 

Eo,.[</)(5,T)|T]=a         Vvr.  (10) 

It  follows  from  the  preceding  considerations  and  the  Neyman-Pearson  lemma  that 
if  W  is  a  cdf  on  R''"+\  then  the  WAP  criterion  (8)  is  maximized  among  a-similar  tests 
by  the  test  with  test  function  given  by 


lW 


(s,  t;  a)  =  1  [LR^  (5,  t)  >  k^^  {t;  a)]  ,  ■  (11) 

where  1  [■]  is  the  indicator  function,  -  ' 
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^^     ^'''^"  fsis\0,0)frmO) ^'^^ 

and  KYii{t;(y)  is  the  I  -  a  quantile  of  the  distribution  of  LR^  {Z^J.) ,  where  Z^ 
is  distributed  J\f{0,lk)-^  Because  the  maximizer  cp^R  is  essentially  unique  (in  the 
measure  theoretic  sense),  the  test  with  test  function  (/"^(sQ;)  is  a-admissible.''  To 
demonstrate  a-admissibihty  of  a  test,  it  therefore  suffices  to  show  its  test  function 
can  be  represented  as  ^^  (•;«)  for  some  W.  Section  3.2  uses  that  approach  to  show 
that  the  WAP-LR  tests  of  AMS  are  all  a-admissible. 

3.2.  Admissibility  of  WAP-LR  tests.  The  WAP-LR  tests  are  indexed  by  cdfs 
on  R  X  R+.'  Accordingly,  let  u;  be  a  cdf  on  M  x  R+  and  define 


C-{s,t)=  I         exp(-^)   „A 


dw{p,X),  (13) 


where  qFi  is  the  regularized  confluent  hypergeometric  function, 


c(M)=(;;: ;;;).  (.4) 

and  7/^  is  defined  as  in  Section  2.   By  Corollary  1  of  AMS,  the  test  function  of  the 
size  a  WAP-LR  test  associated  with  w  is 

'/':^Ms(5,t;«)  =  l[£-(s,t)>«:^,,s(t;a)],  (15) 

where  k'^ms  (^5  '^)  i^  the  I  —  a  quantile  of  the  distribution  of  £"'  {Zk,  t)  .^'^ 

^Details  are  provided  in  an  Appendix,  available  from  the  authors  upon  request. 

^As  pointed  out  by  a  referee,  this  fact  is  a  special  case  of  the  more  general  decision  theoretic 
result  that  (essentially)  unique  Bayes  rules  are  admissible  [e.g.,  Ferguson  (1967,  Theorem  2.3.1)]. 

^Because  the  power  of  an  invariant  test  depends  on  n  only  through  the  scalar  tt'tt,  the  corre- 
sponding WAP  criterion  (8)  depends  on  W  only  through  the  cdf  on  R  x  1R+  given  by 


W(/?,A)   =     /  dW{b,TT). 

*£'"  {s,t)  is  proportional  to  ip^,  iqi,qT)  in  Lemma  1  of  AMS  because  qFi  (;  fc/2;  z/4)  is  propor- 
tional to  2~*''"^"''''*/(fc-2)/2  (\A) .  where  Jj,  (•)  denotes  the  modified  Bessel  function  of  the  first  kind 
of  order  i/. 

^The  size  a  WAP-LR  test  is  a-similar  by  construction  and  is  rotation  invariant  because 
'^AMS  (*;  ^)  depends  on  t  only  through  t't. 
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For  our  purposes,  it  is  convenient  to  characterize  the  defining  optimaUty  property 
of  (pjj^fs  as  follows.  Let  W^^s  denote  the  cdf  (on  R^+i)  of  (b,  y/KU'S  ,  where  [B,  A)' 
has  cdf  w  and  Uk  is  uniformly  distributed  on  the  unit  sphere  in  R*"  (independently 
of  B  and  A).  In  terms  of  W^j^^g,  Theorem  3  of  AMS  asserts  that  the  test  with  test 
function  (f>^MS  (s  ^)  maximizes 

E0,,cPiS,T)dW:^^siP^^) 

among  rotation  invariant,  a-similar  tests.  In  this  optimality  result,  the  assumption 
of  rotation  invariance  is  unnecessary  because  it  turns  out  that 

rAMs{s,t;a)  =  <p^t"is,t;a).  (16) 

The  displayed  equality  fohows  from  a  calculation  performed  in  the  proof  of  the  fol- 
lowing strengthening  of  Theorem  3  of  AMS. 

Theorem  1.  Let  w  be  a  cdf  on  M  x  R+.  If  (/>  satisfies  (7) ,  then 

f      E0,^<f^  (5,  T)  dW^^s  iP^  ^)<  [      Ep^^rAMS  (5,  T-  a)  dW^^s  (/?,  ^)  -. 

wlieve  the  inequahty  is  strict  unless  PiCff^.^  [0  [S,  T)  =  4>^ms  {S,  T;  a)]  =  1  for  some 
(and  hence  for  all)  (/?,  tt)  .  In  particular,  the  size  a  WAP-LR  test  associated  with  w 
is  a-admissible. 

The  testing  function  of  the  size  a  AR  test  (for  known  Q)  is  given  by 

4>^j,{s,t;a)  =  l[s's>xl{k)],  (17) 

where  xi  (^)  is  the  1  —  a  quantile  of  the  x^  distribution  -with  k  degrees  of  freedom.  As 
remarked  by  AMS,  £"'  (s,  t)  is  an  increasing  function  of  s's  whenever  the  weight  func- 
tion w  assigns  unit  mass  to  the  set  {(^5,  A)  G  R  x  R+  :  /?  =  6^^j  .  As  a  consequence, 
the  following  is  an  immediate  consequence  of  Theorem  1 . 

Corollary  2.   The  size  a  AR  test  is  a-admissible. 

In  spite  of  the  fact  that  the  AR  test  is  a  A:  degrees  of  freedom  test  appUed  to 
a  testing  problem  with  a  single  restriction,  a  fact  which  suggests  that  its  power 
properties  should  be  poor  [e.g.,  Kleibergen  (2002,  p.  1781),  Moreira  (2003,  p.  1031)], 
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Corollary  2  implies  that  there  is  no  conventional  statistical  sense  in  which  the  AR 
test  "wastes  degrees  of  freedom" . 

In  addition  to  the  AR  test,  Theorem  1  also  covers  the  two-point  optimal  invari- 
ant similar  tests  of  AMS,  the  power  functions  of  which  trace  out  a  two-sided  power 
envelope  for  rotation  invariant,  a-similar  tests.  On  the  other  hand,  the  CLR  and 
LM  tests  do  not  seem  to  belong  to  the  class  of  WAP-LR  tests.  Indeed,  it  would 
appear  to  be  an  open  question  whether  one  or  both  of  these  tests  even  belong  to  the 
closure  (appropriately  defined)  of  the  class  WAP-LR  tests.  Section  3.3  provides  an 
affirmative  answer  to  that  question. 

Remark.  As  pointed  out  by  a  referee,  two  distinct  generalizations  of  the  results 
of  this  section  seem  feasible.  First,  the  conclusion  that  a  Bayes  rule  corresponding 
to  a  distribution  ti;  on  R  x  R_,_  can  be  "lifted"  to  a  Bayes  rule  corresponding  to  a 
distribution  on  R'^+^  (by  introducing  a  Uk  which  is  uniformly  distributed  on  the  unit 
sphere  in  R'^  and  independent  of  (/3,  A))  appHes  to  more  general  decision  problems 
than  the  one  considered.  Second,  using  Muirhead  (1982,  Theorems  2.1.14  and  7.4.1) 
it  should  be  possible  to  generalize  the  results  to  a  model  with  multiple  endogenous 
regressors.  To  conserve  space,  we  do  not  pursue  these  extensions  here. 

3.3.      The  CLR  test.     The  test  function  of  the  size  a  CLR  test  is 

(l>CLR{s,t]a)  =  l[LR{s,t)>KcLR{t;a)],  (18) 

where 

LR  {s,  t)  =  ^  (s's  -  ft  +  ^{s's  +  t'tf  -  4  [{s's)  [ft)  -  {s'tf]  ]  (19) 

and  KcLR  [t]  ct)  is  the  1  —  a  quantile  of  the  distribution  of  LR  {Z^,  t) . 

In  numerical  investigations,  the  CLR  test  has  been  found  to  perform  remarkably 
well  in  terms  of  power.  For  instance,  AMS  find  that  the  power  of  the  size  a  CLR 
test  is  "essentially  the  same"  as  the  two-sided  power  envelope  for  rotation  invariant, 
Q-similar  tests.  In  hght  of  this  numerical  finding,  it  would  appear  to  be  of  interest  to 
analytically  characterize  the  relation  (if  any)  between  the  CLR  test  and  the  class  of 
WAP-LR  tests. 

Theorem  3  shows  that  CLR  (s,  t)  can  be  represented  as  the  hmit  as  A''  -^-  oo  of  a 
suitably  normalized  version  of  C"clr,n  (^^  {^  ^  where  {wclr,n  :  TV  G  N}  is  a  carefully 
chosen  collection  of  cdfs  on  R  x  R+ . 

To  motivate  this  representation  and  the  functional  form  of  wclr,n,  it  is  convenient 
to  express  the  test  function  of  the  CLR  test  as 
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(J)CLR  (s.  ^;  a)  =  1  [CLR*  (s,  t)  >  KcLR-  {t;  a) 
where 


CLR*{s,t)    =     \s's  +  t't+J{s's  +  t'tY-A[{s's){t't)-{s't) 


^2[LR{s,t)  +  t't]  (20) 


and  KcLR*  {t;  a)  =  ^/2[KcLR{t;a)  +t't]. 

The  statistic  CLR*  (s,  i)  is  the  square  root  of  the  largest  eigenvahie  of  Q  (s,  t) 
(defined  in  (14))  and  therefore  admits  the  following  characterization: 


CLR*  {s,  t)  =  A /max^eR2.^-^=i  -q'Q  (s,  t)  r).  (21) 


Moreover,  the  integrand  in  (13)  can  be  written  as 


exp ^      0^1 


where  Tjp  =  V/s/ x/v'pVp  i^  ^  vector  of  unit  length  proportional  to  7/^.  If  wclr.n  is  such 
that  its  support  is  the  set  of  all  pairs  (/3,  A)  for  which  Ary^r/^  =  A'',  then  the  integrand 
is  maximized  (over  the  support  of  wclr,n)  by  setting  7/^  equal  to  the  eigenvector 
associated  with  the  largest  eigenvalue  of  Q  (s,  t) .  This  observation,  and  the  fact  that 
the  tail  behavior  of  oi^i  [;  k/2;  •/4]  is  similar  to  that  of  exp  (i/^)  ,  suggests  that  the  large 
A^  behavior  of  C^clr.n  (^^  f)  "should"  depend  on  Q  (s,  t)  only  through  CLR*  {s,  t) . 

For  any  A^  >  0,  let  wclr,n  denote  the  cdf  of  [B,  N/^/rf^)' ,  where  ^  ~  A^  (0, 1)  .^^ 
By  construction,  wclr,n  is  such  that  its  support  is  the  set  of  all  pairs  (/?,  A)  for  which 
^v'ffVp  =  ^-  Using  that  property,  the  relation  (21) ,  and  basic  facts  about  o^^i  (;  k/'2;  ■) , 
we  obtain  the  following  result. 


^"^The  distributional  assumption  B  ~  TV  (0, 1)  is  made  for  concreteness.  An  inspection  of  tiie  proof 
of  Theorem  3  shows  that  (22)  is  valid  for  any  distribution  (of  B)  whose  support  is  R.  Furthermore, 
as  pointed  out  by  a  referee  it  is  possible  to  obtain  analogous  results  without  making  the  distribution 
of  ^VoVp  degenerate. 
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Theorem  3.  For  any  (s',t')'  G 


In  particular, 


CLR*  (s,  t)  =  lim7v-.oo  ^p= 


log  Z:""^^«''^  (s,  t)  + — 


(22) 


CLR{s,t)  =  lim^ 


2N 


logC"^'^"-^  {s,t)  + 


N 


t't. 


(23) 


Define  -C^^^^yv  («>*)  =  [log  £"'^^«"^  (s,i)  +  A^/2]  /\/iV.  The  proof  of  Theorem  3 
shows  that  as  A''  — >•  oo,  C*qiii^n  (')  converges  to  CLR*  (•)  in  the  topology  of  uniform 
convergence  on  compacta.  Using  this  result,  it  fohows  that 

Hmyv^oo  E0,,  I^Z-S'"  (5,  T-  a)  -  <Pclr  (S,  T;  a) |  =  0         V  (/?,  n) . 

In  particular,  the  power  function  of  the  WAP-LR  test  associated  with  wclr,n  con- 
verges (pointwise)  to  the  power  function  of  the  CLR  test  (as  N  -^  oo). 

In  hght  of  the  previous  paragraph,  it  seems  plausible  that  the  CLR  test  enjoys  an 
"admissibility  at  cx)"  property  reminiscent  of  Andrews  (1996,  Theorem  1(c)).  Veri- 
fying this  conjecture  is  not  entirely  trivial,  however,  because  the  unboundedness  (in 
t)  of  K.CLR*  [t]  ci)  makes  it  difficult  (if  not  impossible)  to  adapt  the  proofs  of  Andrews 
and  Ploberger  (1995)  and  Andrews  (1996)  to  the  present  situation. 

Theorem  3  also  suggests  a  method  of  constructing  tests  which  share  the  nice  nu- 
merical properties  of  the  CLR  test  and  furthermore  enjoy  demonstrable  optimality 
properties,  namely  tests  based  on  C^clr.n  (^s,t)  for  some  "large"  (possibly  sample- 
dependent)  value  of  N.  Because  the  power  improvements  (if  any)  attainable  in  this 
way  are  likely  to  be  slight,  the  properties  of  tests  constructed  in  this  way  are  not 
investigated  in  this  paper. 


Remark.  The  test  function  of  the  size  a  LM  test  is 

'{s'tf 


[s,t;a)  =  1 


t't 


>x;(i) 


where  Xo  (1)  is  the  1  —  a  quantile  of  the  x^  distribution  with  1  degree  of  freedom. 
AMS  show  that  a  one-sided  version  of  the  LM  test  can  be  interpreted  as  a  limit  of 
WAP-LR  tests  (and  is  locally  most  powerful  invariant).  On  the  other  hand,  we  are 
not  aware  of  any  such  results  which  cover  the  (two-sided)  LM  test. 
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A  slight  variation  on  the  argument  used  in  the  previous  subsection  can  be  used  to 
obtain  a  WAP-LR  interpretation  of  the  LM  test.  Indeed,  letting  Wlm,n  denote  the 
cdf  of  [Bn-,  N/ ^^v'bn^Bn]'  ^  where  the  distribution  of  N^/^Bn  is  uniform  on  [-1, 1] , 
it  can  be  shown  that" 


\s't\  l\fFt  =  hmA,_.oo  -TTJ 


iVl/6 


log  £^"-A^  (s,  t)-\-  —  -  VnVFi 


implying  in  particular  that 

n2 


(s'ty  _  1 


N  r—     r— 


(24) 


(25) 


4.     Proofs 
Proof  of  Theorem  1.  It  suffices  to  establish  (16) .  To  do  so,  it  suffices  to  show  that 
LI^AMs  (5,  t)  is  proportional  to  D"  (s,  i) .  Now, 


/s(5|0,0)/T(i|0,0) 


exp  (-\^  [lis  -  /?^f  -  ||s||-  +  \\t  -  (1  -  bH)  vrf  -  jjtf  ]") 


exp 


^J!Mll\  exp  [y\,\(is  +  (1  -  6/3)  tj'yr)  , 


where  ||-||  signifies  the  Eucfidean  norm,  A^r  =  tt'tt,  and  tt  =  A^^^^tt. 

Because  tt  has  unit  length,  it  follows  from  Muirhead  (1982,  Theorem  7.4.1)  that 


Lir'^Ms  {^s,t)    = 


fs{s\P,7r)fT{t\P,7r) 
/s(5|0,0)/r(t|0,0) 


dW^,jsif3,7r) 


exp 


Fi 


-■,-V'f3Q{S:t)V0 


dw{P,X), 


as  was  to  be  shown.  (We  are  grateful  to  a  referee  for  suggesting  the  use  of  Muirhead 
(1982,  Theorem  7.4.1).)         ■ 


Proof  of  Theorem  3.  The  result  is  obvious  if  Q  (s,  t)  =  0,  so  suppose  Q  (5,  t)  7^  0. 

^^ Details  arc  provided  in  an  Appendix,  available  from  the  authors  upon  request.  An  inspection  of 
the  proof  given  there  shows  that  the  distributional  assumption  on  N^/^Bn  is  made  for  concrcteness 
insofar  as  the  preceding  representations  arc  valid  whenever  N^/'^Bn  has  a  (fixed)  distribution  whose 
support  is  [—1,1]. 
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The  proof  will  make  use  of  the  fact  [e.g.,  Andrews  and  Ploberger  (1995,  Lemma 
2)]  that  0  <  C  <  C  <  oo,  where 


C  =  inf. 


A[;k/2;z/i] 


exp  (y^)  max(2, 1)   ^"    '' 
By  construction, 


C  =  sup^ 


,Fr[;k/2-z/4] 
exp(v^) 


C" 


{s,t) 


exp  I ;^—  )   0^1 


N 
=    exp  I  -  — 


2 


7;',-7V'i3Qis,t)ri0 


T  4 


dwr 


(/?,A) 


d$(/?) 


where  <J>  (•)  is  the  standard  normal  cdf.  Using  this  representation,  the  relation  (21) 
and  monotonicity  of  oi^i  [;  /c/2;  ■]  , 


£"'c^^«'~(s,t)exp 


TV 


^F^ 


d$(/3) 


'24  ^      ' 


<  oFx 

<  Cexp\VNCLR*is,t)\  , 


from  which  it  follows  immediately  that 
1 


limr 


^A^ 


N 


Iq^jTwclr.n  (5^ij  + 


<  CLR*[s,t) 


On  the  other  hand,  for  any  0  <  e  <  CLR*  {s,t) , 


(26) 


£"'"«.'^(s,t)exp(  y  )     = 


3^1 


(^,t) 


-;  jr?aQ(s,f)r?^ 


)i^i 


d«i>(/?) 

d$(/3) 


>    exp[\/A(CL/?*(s,f)-e: 


xCmaxlN  ■  CLR*  {s,ty  ,1] 


-(fc-l)/4 


rf$(/3) 


'.(s,i) 
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where 


B,  {s,  0  =  {/3  :  ^Jf]],Q{s,t)fi0  >  CLW  (.s,  t)  -  e], 

the  first  inequahty  uses  positivity  of  qFi  [;  k/2\  •]  ,  and  the  last  inequality  uses  (21)  and 
monotonicity  of  o-^i  [;  k/2\  ■]  .  The  integral  j^  .  ^,  d^  (/?)  is  strictly  positive  because 
$  (•)  has  full  support,  so 


lim/v^c 


nv 


log/:"'"«''^(s,i)  + 


A^ 


>  CLR*{s,t)-£. 


Letting  e  tend  to  zero  in  the  displayed  inequality,  we  obtain  an  inequality  which  can 
be  combined  with  (26)  to  yield  (22) . 
Indeed,  because 


sup(y _(,)'g/^  CLi?*  (s,t)  <  oo         and         inf^^/ ^/j'^/,^  /         (i$  (/?)  >  0 
for  any  compact  set  A'  C  M?^,  the  result  (22)  can  be  strengthened  as  follows: 


hm 


N-^oo^^^V(s>,t')'eK 


7n 


]C"-'^^'''^  {s,t)  + 


N' 


CLR*{s,t) 


0. 
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5.     Appendix:  Omitted  Proofs 
This  Appendix  provides  a  proof  of  (24)  and  a  derivation  of  (11)  —  (12) . 

5.1.  Proof  of  (24).  To  motivate  tlie  result,  notice  that  if  wlm,n  is  such  that  its 
support  consists  of  all  pairs  (/3,  A)  for  which  \0\  <  N'^^^  and  Xripij^  =  N,  then  the 
large  A^  behavior  of  the  integrand  in  (13)  is  reminiscent  of  the  behavior  of 


exp 


TV 


dFi 


kN 
2'T 


{t't  +  2ps't) 


Because 


maX|^l<;v->/3  \/t't  +  2(3s't    =     ^Vt  +  2N-y^\s't\ 


reasoning  similar  to  that  leading  to  Theorem  3  therefore  suggests  that  |s'^|  /Vt't  can 
be  represented  as  the  limit  as  A^  — >  oo  of  a  suitably  normalized  version  of  0"^m,n  (^^  i^ 

As  in  Section  3.3,  let  Wlm,n  denote  the  cdf  of  [B^ ,  N/  ^Jv'bn ^Ibn ]  '  where  the 
distribution  of  N^^^Bn  is  uniform  on  [—1,1].  Then  ,  . 


C"^'''"  {sj) 


exp )   oi^i 


A:    1 


—    exp 


N\  7V1/3 


)Fi 


|-_Af-i/3_Ar-i/3] 


d'WLM,N  (/?,  A) 


'  2'  4 


The  integrand  can  be  written  as 

0-^1 

where,  for  some  finite  constant  R  (depending  only  on  s  and  t), 


k   N 

-■-{t't  +  2l3s't  +  RN[^\s,t)) 


VpVis 


d/3. 


SUP|/3|<w-i/3  \Rn  iP;  S,t)\=  SUP|^|<^-i/3 


i'0Q{s,t)v0 


Nv'pVp 


{t't  +  2(3s't) 


<  N-^I^R. 
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As  a  consequence,  by  monotonicity  of  qFi  [;  /c/2;  •]  and  the  defining  property  of  C 
(appearing  in  the  proof  of  Theorem  3), 


£Wi^M,N  (^^  ^)  gj.p 


A^ 


<  4f  /  „A 

^    J\-\/Vn,i/Vn] 


k   N 
2'T 


[t't  +  2(5s't  +  7V-2/^i?) 


dp 


-     SUP|^|<yv-i/3    o-f^i 
=        0^1 


24V  / 


<    Cexp    ^N\t't  +  2N'^l^\s't\+N-^I^R   , 


from  which  it  follows  that 


7V-V6 


<    A^^i/*^  logC  +  N^/\/t't  +  27V-1/3  |5'f  I  +  7V-2/3i?  -  N^/^VVt 

R 


2yM 


where  the  last  inequality  uses  concavity  of  y^. 

To  obtain  an  inequality  in  the  opposite  direction,  we  distinguish  between  the  cases 
where  t  =  0  and  t  7^  0.  If  t  =  0,  then 
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7V-V6 


A^i/6 


A^         I—  I— 


)Fi 


[_/V-l/3,iV-V3] 


k   Nri'f^Q{s,t)r]p 


2'  4 


VpVf3 


dp 


>    N-^/^  oFi 


¥ 


O  {N-"^)  , 


which  completes  the  proof  of  (24)  in  the  case  where  t  —  Q. 

On  the  other  hand,  if  i  7^  0,  then  it  follows  from  monotonicity  and  positivity  of 
oFi  [;  k/2]  •]  that 


£t.^LM,N(5^^)exp 


N 


7V1/3 


Ari/3 


iF, 


l-iV-i/3,Ar-V3] 


fc     TV 

2'T 


[t't  +  2(5s't  -  N-^I^R) 


d/3 


5F1 


[o,yv-i/3] 


.L!l^t't  +  2p\s't\-N-~/m) 


dp 


for  every  A  >  iV  =  inf  { A  :  t't  -  2N~^'^  \s't\  -  N'^^^R  >  0}  .  Therefore,  for  any 
0  <  £  <  1  and  any  N  >  K, 


jru>LM,M   (5^  ^)  gj-p 


N 


iVi/3 


0-^1 


[(l~£)W-V3,iV-l/3] 


k_N 
2'T 


't't  +  2(3\s't\-N-''/'^R) 


dp 


^      9  "^f/3e[(l-e)A'-i/3,/v-i/3]    o-f^l 


^■j{t't  +  2P\s't\-N-'/m) 


3F1 


;  -;  -  (i'i  +  2  (1  -  e)  A-i/^  \s't\  -  iV^/^^R) 


>    -C\Nt't-N'i^R 


'/_/vi/37^ -('-''/' 


exp 


N\  t't  +  2  (1  -  e)  A-i/3  is'tl  -  A^-2/3i? 
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where  the  first  inequahty  uses  positivity  of  o^^i  [;  k/'2\  •]  and  the  last  inequality  uses 
the  defining  property  of  C_  (appearing  in  the  proof  of  Theorem  3)  and  monotonicity 

„f[.]-(fc-l)/4_ 

By  implication, 


Ar-i/6 


>    iV"i/6 


N  r-    I— 


log  {Cel2)  -  ^^-^  log  (TVt'i  -  N^I^'R) 


+N^'\lt't  +  2  (1  -  e)  yV^V3  \s't\  -  N-ym  -  N^^^VFt 
(1  -£)\s't\  -  N~^/'^R/2 


'ft  +  max  [2  (1  -  e)  A^-V3  \s't\  -  N-^m,  O] 
>    {l-e)\s't\/^t  +  0(N-^^HogN). 


+  0{N-^/^\ogN) 


The  proof  of  (24)  can  now  be  completed  by  letting  e  tend  to  zero  in  the  preceding 
display. 

5.2.      Derivation  of  (11)  —  ( 12).     The  test  based  on  (p  {S,  T)  is  a-similar  if  and  only 
if 

/      I    (p{s,t)fs{s\0,n)fT{t\0,n)dsdt  =  a        Vtt. 

Because  T  is  complete  under  Hq  :  (3  =  0,  the  preceding  condition  holds  if  and  only  if 

/    (f){s,t)  fs  {s\0,7r)ds  =  a 

./«'■■ 

for  almost  every  t  G  M.'^.  Furthermore,  fs  (s|0,7r)  does  not  depend  on  tt,  so  the  test 
based  on  0(5,  T)  is  a-similar  if  and  only  if 


/    ((>{s,t)fsis\0,0)ds  =  a 


for  almost  every  t  € 
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Using  (the  Fubini  theorem  and)  the  representation  (9) ,  we  can  write  (8)  as 


{s,t) 


fs{s\P,1T)fTit\(3,7T)dW{f3,Tv) 


dsdt. 


To  maximize  this  expression  subject  to  the  condition  that  the  test  based  on  0  (5,  T)  is 
a-similar,  we  can  proceed  on  a  "t  by  f  basis  and  consider  the  problem  of  maximizing 


subject  to  the  condition 


/5(5|/?,^)/T(t|/?,7r)diy(/?,7r) 


ds 


By  the  Neyman-Pearson  lemma,  this  maximization  problem  is  solved  by 


[s,t-a)  =  l\LR     {s,t)>KYi^{t;a: 


where 

~w       ,,,       J^,,,fs{s\P,n)fT{t\P,n)dW(A7r) 

—  w 
and  k^u  {t;  a)  is  the  1  -  a  quantile  of  the  distribution  of  LR     (2^.,  t) .  Finally,  because 

/r  (t|0,  0)  is  positive  we  obtain  the  equality 


^LRis,ta)  =  (l)^R{s,t]a) 


Date  Due 


